5 research outputs found
A Comprehensive Review of YOLO: From YOLOv1 and Beyond
YOLO has become a central real-time object detection system for robotics,
driverless cars, and video monitoring applications. We present a comprehensive
analysis of YOLO's evolution, examining the innovations and contributions in
each iteration from the original YOLO to YOLOv8 and YOLO-NAS. We start by
describing the standard metrics and postprocessing; then, we discuss the major
changes in network architecture and training tricks for each model. Finally, we
summarize the essential lessons from YOLO's development and provide a
perspective on its future, highlighting potential research directions to
enhance real-time object detection systems.Comment: 31 pages, 15 figures, 4 tables, submitted to ACM Computing Surveys
This version includes YOLO-NAS and a more detailed description of YOLOv5 and
YOLOv8. It also adds three new diagrams for the architectures of YOLOv5,
YOLOv8, and YOLO-NA
Land Cover Image Classification
Land Cover (LC) image classification has become increasingly significant in
understanding environmental changes, urban planning, and disaster management.
However, traditional LC methods are often labor-intensive and prone to human
error. This paper explores state-of-the-art deep learning models for enhanced
accuracy and efficiency in LC analysis. We compare convolutional neural
networks (CNN) against transformer-based methods, showcasing their applications
and advantages in LC studies. We used EuroSAT, a patch-based LC classification
data set based on Sentinel-2 satellite images and achieved state-of-the-art
results using current transformer models.Comment: 7 pages, 4 figures, 1 table, published in conferenc
Loss Functions and Metrics in Deep Learning
One of the essential components of deep learning is the choice of the loss
function and performance metrics used to train and evaluate models. This paper
reviews the most prevalent loss functions and performance measurements in deep
learning. We examine the benefits and limits of each technique and illustrate
their application to various deep-learning problems. Our review aims to give a
comprehensive picture of the different loss functions and performance
indicators used in the most common deep learning tasks and help practitioners
choose the best method for their specific task.Comment: 53 pages, 5 figures, 7 tables, 86 equation
Dynamic Measurement of Portos Tomato Seedling Growth Using the Kinect 2.0 Sensor
Traditionally farmers monitor their crops employing their senses and experience. However, the human sensory system is inconsistent due to stress, health, and age. In this paper, we propose an agronomic application for monitoring the growth of Portos tomato seedlings using Kinect 2.0 to build a more accurate, cost-effective, and portable system. The proposed methodology classifies the tomato seedlings into four categories: The first corresponds to the seedling with normal growth at the time of germination; the second corresponds to germination that occurred days after; the third category entails exceedingly late germination where its growth will be outside of the estimated harvest time; the fourth category corresponds to seedlings that did not germinate. Typically, an expert performs this classification by analyzing ten percent of the randomly selected seedlings. In this work, we studied different methods of segmentation and classification where the Gaussian Mixture Model (GMM) and Decision Tree Classifier (DTC) showed the best performance in segmenting and classifying Portos tomato seedlings
Dynamic Measurement of Portos Tomato Seedling Growth Using the Kinect 2.0 Sensor
Traditionally farmers monitor their crops employing their senses and experience. However, the human sensory system is inconsistent due to stress, health, and age. In this paper, we propose an agronomic application for monitoring the growth of Portos tomato seedlings using Kinect 2.0 to build a more accurate, cost-effective, and portable system. The proposed methodology classifies the tomato seedlings into four categories: The first corresponds to the seedling with normal growth at the time of germination; the second corresponds to germination that occurred days after; the third category entails exceedingly late germination where its growth will be outside of the estimated harvest time; the fourth category corresponds to seedlings that did not germinate. Typically, an expert performs this classification by analyzing ten percent of the randomly selected seedlings. In this work, we studied different methods of segmentation and classification where the Gaussian Mixture Model (GMM) and Decision Tree Classifier (DTC) showed the best performance in segmenting and classifying Portos tomato seedlings